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Ensifer meliloti WSM1022 is an aerobic, motile, Gram-negative, non-spore-forming rod that 
can exist as a soil saprophyte or as a legume microsymbiont of Medicago. WSM1022 was 
isolated in 1987 from a nodule recovered from the roots of the annual Medicago orbicularis 
growing on the Cyclades Island of Naxos in Greece. WSM1 022 is highly effective at fixing ni- 
trogen with M. truncatula and other annual species such as M. tornata and M. littoralis and is 
also highly effective with the perennial M. sativa (alfalfa or lucerne). In common with other 
characterized E. meliloti strains, WSM1022 will nodulate but fixes poorly with M. 
polymorpha and M. sphaerocarpos and does not nodulate M. murex. Here we describe the 
features of E. meliloti WSM1 022, together with genome sequence information and its annota- 
tion. The 6,649,661 bp high-quality-draft genome is arranged into 121 scaffolds of 125 
contigs containing 6,323 protein-coding genes and 75 RNA-only encoding genes, and is one 
of 100 rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Ge- 
nomic Encyclopedia for Bacteria and Archaea -Root Nodule Bacteria (GEBA-RNB) project. 



Introduction 

An available source of nitrogen (N) is essential to 
life on Earth. Although the atmosphere consists of 
approximately 80% N, the overwhelming propor- 
tion of this is present in the form of dinitrogen 
(N2) which is biologically inaccessible to the vast 
majority of higher organisms. Only a subset of mi- 
crobes has the necessary molecular machinery to 
make atmospheric N2 bioavailable by enzymatical- 
ly reducing N2 to NH3. The fact that plant growth is 
most commonly limited by the availability of N 
may have provided the selective pressure for a 
wide range of plant genera, most of which are leg- 
umes, to evolve a symbiotic relationship with the- 
se ISh-fixing microbes. These microsymbionts, col- 
lectively termed root nodule bacteria, receive a 
carbon source from the plant and in return supply 



the host with biologically fixed N. When these 
symbiotic interactions are optimally harnessed in 
agriculture, all the N-requirements of the host can 
be met, without the need to apply industrially syn- 
thesized N-based fertilizers, thereby increasing 
both the economic and environmental sustainabil- 
ity of the farming system [1]. 

Forage and fodder legumes play an integral role in 
sustainable farming practice, providing feed for 
stock while also enriching soil with bioavailable N. 
Worldwide, there are approximately 110 million 
ha of forage and fodder legumes under production 
[2], of which members of the Medicago genus 
comprise a considerable component. Two bacteri- 
al species, Ensifer meliloti and E. medicae are 
known to nodulate and fix N2 with Medicago spp. 
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[3], although they differ in their symbiotic proper- 
ties on some Medicago hosts. Specifically, while E. 
medicae can nodulate and fix N2 with M. murex, M. 
arabica and M. polymorpha, E. meliloti does not 
nodulate M. murex, does not fix with M. 
polymorpha and fixes N2 very poorly with M. ara- 
bica [4-6]. 

E. meliloti strain WSM1022 was isolated in 1987 
from a nodule collected from the annual M. orbicu- 
laris growing on the Cyclades Island of Naxos in 
Greece. E. meliloti WSM1022 is a highly effective 
microsymbiont of Medicago, forming efficient N2- 
fixing associations with the annual species M. 
littoralis and M. tornata [7]. In common with E. 
medicae WSM419 [8], WSM1022 also fixes ap- 
proximately twice as much N2 as E. meliloti 1021 
on the model legume M. truncatula All [7]. How- 
ever, unlike E. medicae WSM419, E. meliloti 
WSM1022 is also highly effective with the peren- 
nial M. sativa (alfalfa or lucerne) [7]. Therefore, E. 
meliloti WSM1022 is a broadly effective 
microsymbiont of Medicago spp. and as such rep- 
resents a unique tool for the molecular analysis of 
effective N2 fixation with fully sequenced macro- 
and microsymbionts. Here we present a summary 
classification and a set of general features for E. 
meliloti strain WSM1022 together with a descrip- 
tion of its genome sequence and annotation. 

Classification and features 

E. meliloti WSM1022 is a motile, Gram-negative 
rod (Figure 1 Left and Center) in the order 
Rhizobiales of the class Alphaproteobacteria. It is 
fast growing, forming colonies within 3-4 days 



when grown on half strength Lupin Agar (%LA) 
[9], tryptone-yeast extract agar (TY) [10] or a 
modified yeast- mannitol agar (YMA) [11] at 28°C. 
Colonies on %LA are white-opaque, slightly 
domed and moderately mucoid with smooth mar- 
gins (Figure IRight). 

Minimum Information about the Genome Se- 
quence (MIGS) is provided in Table 1. Figure 2 
shows the phylogenetic neighborhood of E. 
meliloti WSM1022 in a 16S rRNA sequence based 
tree. This strain shares 99.92% and 99.61% se- 
quence identity (over 1290 bp) to the 16S rRNA of 
the fully sequenced E. meliloti 1021 [29] and E. 
medicae WSM419 [8] strains, respectively. 

Symbiotaxonomy 

E. meliloti strain WSM1022 was isolated in 1987 
from a nodule collected from the annual M. orbicu- 
laris growing on the Cyclades Island of Naxos in 
Greece. The site of collection was a gentle slope 
and the soil a sandy-loam texture of pH 7.5-8.0. E. 
meliloti forms nodules (Nod + ) and fixes N2 (Fix + ) 
on a range of annual Medicago spp. as well as the 
perennial M. sativa (Table 2). In common with 
other characterized E. meliloti strains, WSM1022 
does not nodulate M. murex, does not fix N2 with 
M. polymorpha and M. arabica [4,5] and is a poorly 
effective microsymbiont of M. sphaerocarpos [11]. 
However, WSM1022 is broadly effective with the 
alkaline soil-adapted annuals M. littoralis and M. 
tornata as well as the widely grown perennial for- 
age legume M. sativa. In addition, WSM1022 is al- 
so a highly effective microsymbiont for the model 
legume M. truncatula A17. 





Figure 1. Images of Ensifer meliloti WSM1022 using scanning (Left) and transmission (Center) electron microscopy 
and the appearance of colony morphology on a solid medium (Right). 
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Table 1. Classification and general features of Ensifer meliloti WSM 102 2 according to the 
MIGS recommendations [12] 



MIGS ID 


Property 


Term 


Evidence code 






Domain Bacteria 


TAS [1 3] 






Phylum Proteobacteria 


TAS [14] 




Current classification 


Class Alphaproteobacteria 
Order Rhizobiales 
Family Rhizobiaceae 
Genus Ensifer 
Species Ensifer meliloti 


TAS [15,16] 
TAS [16,1 7] 
TAS [18,19] 
TAS [20-22] 
TAS [21] 




Gram stain 


Negative 


IDA 




Cell shape 


Rod 


IDA 




Motility 


Motile 


IDA 




Sporulation 


Non-sporulating 


NAS 




Temperature range 


Mesophile 


NAS 




Optimum temperature 


28°C 


NAS 




Salinity 


Non-halophile 


NAS 


MIGS-22 


Oxygen requirement 


Aerobic 


TAS [7] 




Carbon source 


Varied 


NAS 




Energy source 


Chemoorganotroph 


NAS 


MIGS-6 


Habitat 


Soil, root nodule, on host 


TAS [7] 


MIGS-15 


Biotic relationship 


Free living, symbiotic 


TAS [7] 


MIGS-14 


Pathogenicity 


Non-pathogenic 


NAS 




Biosafety level 


1 


TAS [23] 




Isolation 


Root nodule 


TAS [11] 


MIGS-4 


Geographic location 


Naxos, Greece 


TAS [11] 


MIGS-5 


Soil collection date 


28 April 1987 


IDA 


MIGS-4.1 


Longitude 


37.107772 


IDA 


MIGS-4.2 


Latitude 


25.387841 




MIGS-4. 3 


Depth 


0-1 0cm 




MIGS-4.4 


Altitude 


Not recorded 





Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., 
a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not di- 
rectly observed for the living, isolated sample, but based on a generally accepted property 
for the species, or anecdotal evidence). These evidence codes are from the Gene Ontolo- 
gy project [24]. 
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57 



57 



72 



99 



Ensifer meliloti GVPV12 (Gi08912) 
Ensifer meliloti AK83 (Gc0 1 8 1 0)* 
Ensifer meliloti Sm1021 (Gc00059)* 
Ensifer meliloti SM11 (CP001830 Gc01686)* 
Ensifer meliloti Mlalz-1 (Gi08913) 
Ensifer meliloti RRI128 (Gi08915) 
Ensifer meliloti AK58 (Gi07577) 
Ensifer meliloti MVII-I (Gi08914) 
Ensifer meliloti CIAM1775 (Gi08844) 
Ensifer meliloti LMG 6133 T (X67222) 
Ens/fer me/f/otf WSM1 022 (Gi0891 6) 
— Ensifer meliloti (Gi08911) 

Ensifer medicae WSM419 (Gc00590)* 
Ensifer medicae WSM1369 (Gi08907) 
Ens/fer medicae WSM 1 1 1 5 (Gi08906) 
Ensifer medicae Di28 (Gi08905) 
Ensifer medicae WSM244 (Gi08916) 
Ensifer medicae WSM4191 (Gi08903) 
Ens/fer medicae A32 1 T ( L39882) 
Ensifer arboris LMG 14919 (Gi08822) (syn: HAMBI 1552 T ) 
Ensifer saheii LMG 7837 T (X68390) 



94 



71 



51 



62 



Ensifer kostiense LMG 19227 T (AM181748) 



Ensifer sp. TW10 (Gi08835) 
Ensifer sp. WSM 1721 (Gi08904) 



76 



Ensifer chiapanecum ITTG S70 (EU286550) 

Ensifer mexicanum ITTG R7 T (DQ41 1930) 
Ens/fer terangae LMG 7834 T (X68388) 



0.002 



Figure 2. Phylogenetic tree showing the relationship of Ensifer meliloti VVSM1022 (shown in bold print) to other 
Ensifer spp. in the order Rhizobiales based on aligned sequences of the 16S rRNAgene (1,290 bp internal region). All 
sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, 
version 5 [25]. The tree was built using the Maximum-Likelihood method with the General Time Reversible model 
[26]. Bootstrap analysis [2 7] with 500 replicates was performed to assess the support of the clusters. Type strains are 
indicated with a superscript T. Brackets after the strain name contain a DNA database accession number and/or a 
GOLD ID (beginning with the prefix G) for a sequencing project registered in GOLD [28]. Published genomes are 
indicated with an asterisk. 
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Table 2. Nodulation and N 2 fixation properties off. meliloti WSM1022 on selected Medicago spp. 
Data compiled from [7,1 1] + 



Cultivar or 
Species Name Accession 



Growth 

Habit Nodulation N 2 fixation Comment 



M. truncatula 



A17 



Annual Nod + 



Fix + 



Highly effective 



M. truncatula 



Jemalong 



Annual Nod + 



Fix + 



Highly effective 



M. truncatula 



Caliph 



Annual Nod + 



Fix + 



Highly effective 



M. littoral is 



Harbinger 



Annual Nod + 



Fix + 



Highly effective 



M. tornata 



Tornafield 



Annual Nod + 



Fix + 



Highly effective 



M. sphaerocarpos Orion 



Annual Nod + 



Fix + 



Poorly effective 



M. arabica 



SA36043 



Annual Nod + 



Fix- 



No fixation 



M. polymorpha Santiago 



Annual 



Nod + 



Fix- 



No fixation 



M. murex 



Zodiac 



Annual 



Nod" 



Fix" 



No nodulation 



M. sativa 



Sceptre 



Perennial Nod + 



Fix + 



Highly effective 



+ Note that '+' and '-' denote presence or absence, respectively, of nodulation (Nod) or N 2 fixation (Fix). 



Genome sequencing and annotation 

Genome project history 

This organism was selected for sequencing on the 
basis of its environmental and agricultural rele- 
vance to issues in global carbon cycling alterna- 
tive energy production, and biogeochemical im- 
portance, and is part of the Community Sequenc- 
ing Program at the U.S. Department of Energy, 
Joint Genome Institute (JGI) for projects of rele- 
vance to agency missions. The genome project is 
deposited in the Genomes OnLine Database [28] 
and an improved-high-quality-draft genome se- 
quence in IMG. Sequencing, finishing and annota- 



tion were performed by the JGI. A summary of the 
project information is shown in Table 3. 

Growth conditions and DNA isolation 

E. meliloti WSM1022 was cultured to mid loga- 
rithmic phase in 60 ml of TY rich medium [30] on 
a gyratory shaker at 28°C. DNA was isolated from 
the cells using a CTAB (Cetyl trimethyl ammonium 
bromide) bacterial genomic DNA isolation method 
[31]. 
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Table 3. Genome sequencing project information for E. meliloti WSM1 022. 



MIGS ID 


Property 


Term 


MIGS-31 


Finichincr ni 13 1 it\/ 
i 1 1 1 1 s 1 1 1 1 il; uudiny 


1 mn rm/^H h ro h-ni 13 1 it\/ dr3 ft 

Ml IUIUVCU llltill UUdllLy UldlL 


lv\ lUJ-ZO 


LIUIdlltJb UbtrU 


1 V Mill m I 113 \ lr\ 1*3 f\/ 
IX 1 1 1 U 1 1 1 1 1 Id llUldly 




^f»niifinr"ino r~\ afffinnc 

Ot;UUt;IIL.IIIt LJ Id U Ul 1 1 lb 


Illumine Hi^pn ?nnO 
iiiuiiniid i ii jcu z 


J VI 1 vj J J 1 . Z. 


Cpni i£*rif" i no rao£* 

Ot;UUt;llL.lllti L U V t; 1 at t; 


M i irn i na * 0 ~7 K v 

IllUlllllld. Z / Ja 


MIGS-30 


Assemblers 


Velvet version 1.1.04; Allpaths-LG version r42 328 


MIGS-32 


Gene callini? methods 

V_ J ^ 1 IV, V. 1 1 1 1 J 1 ICL 1 1 IV, LI 1 V_/ V_J JJ 


Prodigal 1.4, GenePRIMP 




GOLD ID 


Gi08916 




NCBI project ID 


782 33 




Database: IMG 


2510065057 




Project relevance 


Symbiotic N, fixation, agriculture 



Genome sequencing and assembly 

The genome of Ensifer meliloti WSM1022 was se- 
quenced at the Joint Genome Institute (JGI) using 
Illumina technology [32]. An Illumina standard 
shotgun library was constructed and sequenced 
using the Illumina HiSeq 2000 platform which 
generated 12,082,430 reads totaling 1812.4 Mbp. 

All general aspects of library construction and se- 
quencing performed at the JGI can be found at the 
JGI website [31]. All raw Illumina sequence data 
was passed through DUK, a filtering program de- 
veloped at JGI, which removes known Illumina 
sequencing and library preparation artifacts 
(Mingkun, L., Copeland, A. and Han, J., un- 
published). The following steps were then per- 
formed for assembly: (1) filtered Illumina reads 
were assembled using Velvet [33] (version 
1.1.04), (2) 1-3 kb simulated paired end reads 
were created from Velvet contigs using wgsim 
(https://github.com/lh3/wgsim), (3) Illumina 
reads were assembled with simulated read pairs 
using Allpaths-LG [34] (version r42328). Parame- 
ters for assembly steps were: 1) Velvet (velveth: 
63 -shortPaired and velvetg: -veryclean yes - 
exportFiltered yes -mincontiglgth 500 
scaffolding no-covcutoff 10) 2) wgsim (-e 0-1 
100 -2 100 -r 0 -R 0 -X 0) 3) Allpaths-LG 
(PrepareAllpathsInputs:PHRED64=l PL0IDY=1 
FRAGCOVE RAGE =125 JUMPCOVERAGE=25 
LONGJUMPCOV=50, RunAllpath-sLG: THREADS=8 
RU N=stdshredpair s TARGE T S=standard 

VAP I WARNO N LY =T rue OVERWRITE =True). The 
final draft assembly contained 125 contigs in 121 
scaffolds. The total size of the genome is 6.6 Mb 
and the final assembly is based on 1,812.4 Mbp of 



Illumina data, which provides an average 275* 
coverage of the genome. 

Genome annotation 

Genes were identified using Prodigal [35] as part 
of the DOE-JGI annotation pipeline [36]. The pre- 
dicted CDSs were translated and used to search 
the National Center for Biotechnology Information 
(NCBI) nonredundant database, UniProt, 
TIGRFam, Pfam, PRIAM, KEGG, COG, and InterPro 
databases. The tRNAScanSE tool [37] was used to 
find tRNA genes, whereas ribosomal RNA genes 
were found by searches against models of the ri- 
bosomal RNA genes built from SILVA [38]. Other 
non-coding RNAs such as the RNA components of 
the protein secretion complex and the RNase P 
were identified by searching the genome for the 
corresponding Rfam profiles using INFERNAL 
(http://infernal.janelia.org). Additional gene pre- 
diction analysis and manual functional annotation 
was performed within the Integrated Microbial 
Genomes (IMG-ER) platform [39]. 

Genome properties 

The genome is 6,649,661 nucleotides with 62.16% 
GC content (Table 4) and comprised of 121 scaf- 
folds (Figure 3) of 125 contigs. From a total of 
6,398 genes, 6,323 were protein encoding and 75 
RNA only encoding genes. The majority of genes 
(80.78%) were assigned a putative function whilst 
the remaining genes were annotated as hypothet- 
ical. The distribution of genes into COGs functional 
categories is presented in Table 5. 
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Table 4. Genome Statistics for Ensifer me/,/of/ WSM1022 



Attribute 



Value % of Total 



V_J \ , 1 1 y_j l l j J z_ v_ \ y 


6 649 661 


100.00 


ONA rorlinp rppion fhrO 

i y i i/ \ luui i it i c iui i \ u / 


5 733 017 


86.22 


DNA G+C content (hnl 


4 1 33 661 


62.1 6 


Number of scaffolds 


121 




Nliimhpr of rnntipc: 


125 




Tofpi 1 p pnp 

1 W LCI 1 fcl V. 1 1 V, 


6, 398 


100.00 


RNA ppnps 


75 


1.17 


rRNIA onprons 




0.02 


Protein-coding genes 


6, 323 


98.83 


Genes with function prediction 


5,168 


80.78 


Genes assigned to COGs 


5,147 


80.45 


Genes assigned Pfam domains 


5,331 


83.32 


Genes with signal peptides 


563 


8.80 


Genes with transmembrane helices 


1,437 


22.93 


CRISPR repeats 


0 





SINM1022DRAFT scaffold 0.1 



r-n r— r 



SINM1022DRAFT_scaffold_5.6 



I I 



I III II 
I .» ■ 1 



SINM1022DRAFT scaffold 1.2 



rJ-J 



SINM1022DRAFT scaffold 6.7 



SIN Ml 02 2 DRAFT scaffold 2.3 



I 111111111 in i ■ hi Hi i mil in 

I I , ,,, I I 1 J , , ,,1, II 



SINM1022DRAFT scaffold 3.4 



SINM1022DRAFT_scaffold_4.5 



bwww mm 

>iii 



COO Co* COOFunt»oiiO«!m*on 


■ 1*1 


RNA procasiaig and modification 


IB] 


Crtromavn structure and dynamics 


■ |C| 


■Mm t ■ t-Ji — | Qfl j Mn Ml 


■ |0| 


Cat cyd* control, col Ovtswn. cnromosome partojonmg 


■ IE! 


Amsvo aod transport and rnetaboasm 


■ |FI 


Nucmoikm transport and rnataboasm 


IOI 


Caibonydrata crampon and niatatiimin 


■ IK 


Coenzyme transport and metarxxism 


■ in 


Udh3 transport and matapoaam 


■ U! 


Translator rtboaomal slruduro and taogonests 


■ IK] 


Transcnpbon 


M 


Replication recome-nation and repaa 


■ |M| 


Caa watrrnanvoranawrvalopa biogenesis 


IN] 


I .-■ MtJI 


■ 10) 


Poittranstabonal iltodMkllton. ptoaMt tumovar chaperorves 


■ IP] 


Inorganic ion transport and motapoasm 


|Q> 


avxptidaty motaooatas bwayntnessv transport and cataboesm 


■ |R| 


General tuncbon predKbon only 


■ IS) 


Function unknown 


■ m 


Signal transducson rrvectionrsms 


|U| 


Inrreceftaar baftlclung secretion and vesicular transport 


M 


Oetense mecneivrsms 


■ |W] 


Extrecettutar structures 


■ m 


Nuclear structure 


■ (Z) 


Cytoskoloton 


■ INA] 


Not Assigned 



Figure 3. Graphical map of thegenome of Ensifer /ne,Wof/WSM1022 showing the seven 
largest scaffolds. From bottom to the top of each scaffold: Genes on forward strand 
(color by COG categories as denoted by the IMG platform), Genes on reverse strand 
(color by COG categories), RNA genes (tRNAs green, sRNAs red, other RNAs black), 
GC content, GC skew. 
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Table 5. Number of protein coding genes of Ensifer meliloti WSM1 022 associated with the 
general COG functional categories. 



Code 


Value 


% age 


COG Category 


J 


194 


3.38 


Translation, ribosomal structure and biogenesis 


A 


0 


0.00 


RNA processing and modification 


K 


497 


8.65 


Transcription 


L 


196 


3.41 


Replication, recombination and repair 


B 


1 


0.02 


Chromatin structure and dynamics 


D 


38 


0.66 


Cell cycle control, mitosis and meiosis 


Y 


0 


0.00 


Nuclear structure 


V 


61 


1.06 


Defence mechanisms 


T 


235 


4.09 


Signal transduction mechanisms 


M 


301 


5.24 


Cell wall/membrane biogenesis 


N 


71 


1.24 


Cell motility 


Z 


0 


0.00 


Cytoskeleton 


W 


1 


0.02 


Extracellular structures 


u 


113 


1.97 


Intracel I ular trafficking and secretion 


o 


177 


3.08 


Posttranslational modification, protein turnover, chaperones 


c 


357 


6.21 


Energy production conversion 


G 


606 


10.54 


Carbohydrate transport and metabolism 


E 


623 


10.84 


Amino acid transport metabolism 


F 


109 


1.90 


Nucleotide transport and metabolism 


H 


200 


3.48 


Coenzyme transport and metabolism 


I 


207 


3.60 


Lipid transport and metabolism 


P 


312 


5.43 


Inorganic ion transport and metabolism 




I JO 


Z . / D 


DtfCUNUdry ITIfcJLdUOl I lc UlUbyilLNcMS, LldNbpOrL dNU LdLdUUIIblll 


R 


708 


12.32 


General function prediction only 


S 


583 


10.14 


Function unknown 




1,251 


19.55 


Not in COGS 


Total 


5,748 







Acknowledgements 

This work was performed under the auspices of the US 
Department of Energy's Office of Science, Biological and 
Environmental Research Program, and by the Universi- 
ty of California, Lawrence Berkeley National Laborato- 
ry under contract No. DE-AC02-05CH11231, Lawrence 
Livermore National Laboratory under Contract No. DE- 
AC52-07NA27344, and Los Alamos National Laborato- 
ry under contract No. DE-AC02-06NA25396. We grate- 



fully acknowledge the funding received from the Mur- 
doch University Strategic Research Fund through the 
Crop and Plant Research Institute (CaPRI) and the Cen- 
tre for Rhizobium Studies (CRS) at Murdoch Universi- 
ty. We also acknowledge ECR funding for J. Terpolilli 
awarded by the School of Veterinary and Life Sciences 
at Murdoch University. 



322 



Standards in Genomic Sciences 



Terpolilli et al. 



References 

1. HowiesonJG, O'Hara GW, CarrSJ. Changing roles 
for legumes in Mediterranean agriculture: develop- 
ments from an Australian perspective. Field Crops 
Res 2 000; 65:107-122. 

http://dx.doi .org/1 0. 1 01 6/S0378-42 90(99)00081 -7 

2. Herridge DF, Peoples MB, Boddey RM. Global in- 
puts of biological nitrogen fixation in agricultural 
systems. Plant Soil 2008; 311:1-18. 
http://dx.doi.org/10.1007/sin04-008-9668-3 

3. Graham P. Ecology of the root-nodule bacteria of 
legumes. In: Dilworth MJ, James EK, Sprent Jl, New- 
ton WE, editors. Nitrogen-Fixing Leguminous Sym- 
bioses. Dodrecht: The Netherlands: Springer; 2008. 
p 2 3-43. 

4. Garau G, Reeve WG, Brau L, Yates RJ, James D, 
Tiwari R, O'Hara GW, HowiesonJG. The symbiotic 
requirements of different Medicago spp. suggestthe 
evolution of Sinorhizobium meliloti and S. medicae 
with hosts differentially adapted to soil pH. Plant 
Soil 2005; 276:2 63-2 77. 

http ://dx. doi .org/1 0. 1 007/sl 11 04-005-03 74-0 

5. Rome S, Cleyet-Marel JC, Materon LA, Normand P, 
Brunei B. Rapid identification of Medicago 
nodulating strains by using two oligonucleotide 
probes complementary to 1 6S rDNA sequences. 
Can I Microbiol 1 997; 43:854-861 . Pub Med 
http://dx.doi .org/1 0. 1 1 39/m97-1 2 4 

6. Brunei B, Rome S, Ziani R, Cleyet-Marel JC. Com- 
parison of nucleotide diversity and symbiotic prop- 
erties of Rhizobium meliloti populations from annual 
Medicago species. FEMS Microbbl Ecol 1 996; 
19:71-82. http://dx.doi.Org/10.l111/j.1 574- 

6941. 1996.tb002 00.x 

7. Terpolilli J J, O'Hara GW, Tiwari RP, Dilworth MJ, 
HowiesonJG. The model legume Medicago 
truncatula A1 7 is poorly matched for N 2 fixation 
with the sequenced microsymbiont Sinorhizobium 
meliloti 1 02 1 . New Phytol 2 008; 1 79: 62 -66. Pub- 
Med http://dx.doi.org/10.11 1 1/j.l 469- 
8137.2008.02464.x 

8. Reeve W, Chain P, O'Hara G, Ardley J, Nandesena 
K, Brau L, Tiwari R, Malfatti S, Kiss H, Lapidus A, et 
al. Complete genome sequence of the Medicago 
microsymbiont Ensifer (Sinorhizobium) medicae 
strain WSM419. Stand Genomic Sc/ 2 010; 2:77-86. 
PubMed http://dx.doi.org/10.4056/sigs.43526 

9. HowiesonJG, Ewing MA, D'antuono MF. Selection 
for acid tolerance in Rhizobium meliloti. Plant Soil 
1988; 105:179-188. 

http://dx.doi .org/1 0. 1 007/BF02 376781 



http://standardsingenomics.org 



10. BeringerJE. R factor transfer in Rhizobium 
leguminosarum. J Gen Microbbl \ 97A; 84:188-198. 
PubMed http ://dx. doi .org/1 0. 1 099/0022 1 2 87-84-1 - 
188 

1 1 . Terpolilli JJ. Why are the symbioses between some 
genotypes of Sinorhizobium and Medicago subop- 
timal for N 2 fixation? Perth: Murdoch University; 
2009. 223 p. 

12. Field D, Garrity G, Gray T, Morrison N, SelengutJ, 
Sterk P, Tatusova T, Thomson N, Allen M, Angiuoli 
SV, etal. Towards a richer description of our com- 
plete collection of genomes and metagenomes 
"Minimum Information abouta Genome Sequence " 
(MIGS) specification. Nat Bbtechnol 2008; 26:541- 
547. PubMed http://dx.doi.Org/1 0. 1 038/nbtl 360 

13. Woese CR, Kandler O, Wheel is ML. Towards a nat- 
ural system of organisms: proposal forthe domains 
Archaea, Bacteria, and Eucarya. Proc Natl Acad Sci 
USA 1990; 87:4576-4579. PubMed 

http ://dx . doi . org/ 1 0 . 1 073/p na s . 8 7. 1 2 . 45 76 

14. Garrity GM, Bell JA, Lilburn T. Phylum XIV. 
Proteobacteria phyl. nov. In: Garrity GM, Brenner 
DJ, Krieg NR, StaleyJT (eds), Bergey's Manual of 
Systematic Bacteriology, Second Edition, Volume 2, 
Part B, Springer, New York, 2005, p. 1. 

15. Garrity GM, Bell JA, Lilburn T. Class I. 
Alphaproteobacteria class. In: Garrity GM, Brenner 
DJ, Kreig NR, StaleyJT, editors. Bergey's Manual of 
Systematic Bacteriology. Second ed: New York: 
Springer - Verlag; 2005, p. 1. 

16. Validation List No. 107. List of new names and new 
combinations previously effectively, but notvalidly, 
published. IntJ SystEvol Microbiol 2006; 56:1-6. 
PubMed http://dx.doi .org/1 0. 1 099/ijs.0.641 88-0 

1 7. Kuykendall LD. Order VI. Rhizobiales ord. nov. In: 
Garrity GM, Brenner DJ, Krieg NR, StaleyJT (eds), 
Bergey's Manual of Systematic Bacteriology, Second 
Edition, Volume2, PartC, Springer, New York, 
2005, p. 324. 

1 8. Skerman VBD, McGowan V, Sneath PHA. Ap- 
proved Lists of Bacterial Names. IntJ Syst Bacteriol 
1980; 30:225-420. 

http://dx.doi.org/10.1099/00207713-30-1-225 

19. Conn HJ. Taxonomic relationships of certain non- 
sporeforming rods in soil. ] Bacteriol 1938; 36:320- 
321. 

20. Casida LE. Ensifer adhaerens gen. nov., sp. nov.: a 
bacterial predator of bacteria in soil. IntJ Syst 
Bacteriol 1982; 32:339-345. 
http://dx.doi.org/10.1099/002 07713-32-3-339 

32 3 



E nsifer meliloti strain WSM1022 



2 1 . Young JM. The genus name Ensifer Casida 1 982 
takes priority over Sinorhizobium Chen etal. 1988, 
and Sinorhizobium morelense Wang etal. 2002 is a 
later synonym of Ensifer adhaerens Casida 1982. Is 
the combination Sinorhizobium adhaerens (Casida 
1982) Willemset al. 2003 legitimate? Request for an 
Opinion. IntJ SystEvol Microbbl 2 003; 53:2107- 
2110. Pub Med 

http://dx.doi.Org/10.1099/ijs.0.02665-0 

22. Judicial Commission of the International Committee 
on Systematics of Prokaiyotes. The genus name 
Sinorhizobium Chen et al. 1988 is a later synonym 
of Ensifer Casida 1982 and is not conserved over the 
latter genus name, and the species name 
'Sinorhizobium adhaerens' is not validly published. 
Opinion 84. Int J Syst Evol Microbbl 2 008; 
58:1973. PubMed 

http://dx.doi .org/1 0. 1 099/ijs.0.2 008/005991 -0 

23. GublerM, Hennecke H, Fix A. B and Cgenesare 
essential for symbiotic and free-living, microaerobic 
nitrogen fixation. FEB S Lett 1986; 200:186-192. 
http://dx.doi.Org/1 0.1 01 6/0014-5793(86)80536-1 

24. AshburnerM, Ball CA, Blake JA, Botstein D, Butler 
H, Cherry JM, Davis AP, Dolinski K, DwightSS, 
Eppig JT, etal. Gene ontology: tool for the unifica- 
tion of biology. The Gene Ontology Consortium. 
Nat Genet 2000; 25:2 5-29. PubMed 
http://dx.doi .org/1 0. 1 038/75556 

25. Tamura K, Peterson D, Peterson N, Stecher G, Nei 
M, Kumar S. MEGA5: Molecular Evolutionary Ge- 
netics Analysis using Maximum Likelihood, Evolu- 
tionary Distance, and Maximum Parsimony Meth- 
ods. MolBiol Evol 2011; 28:2 731-2 739. PubMed 
http://dx.doi .org/1 0. 1 093/molbev/msr1 2 1 

2 6. Nei M, Kumar S. Molecular Evolution and 

Phylogenetics. New York: Oxford University Press; 
2000. 

2 7. Felsenstein J. Confidence limits on phylogenies: an 
approach using the bootstrap. Evolution 1985; 
39:783-791. http://dx.doi.Org/1 0.2 307/2 408678 

28. Liolios K, Mavromatis K, Tavernarakis N, Kyrpides 
NC. The Genomes On Line Database (GOLD) in 
2007: status of genomic and metagenomic projects 
and their associated metadata. Nucleic Acids Res 
2008; 36:D475-D479. PubMed 
http://dx.doi.org/10.1093/nar/gkm884 

29. GalibertF, FinanTM, Long SR, PuhlerA, AbolaP, 
Ampe F, Barloy-Hubler F, BarnettMJ, Becker A, 
Boistard P, et al. The composite genome of the leg- 
ume symb iont Sinorhizobium meliloti. Science 



2 001 ; 293: 668-672 . PubMed 
http://dx.doi.org/10.1126/science.1060966 

30. Reeve WG, Tiwari RP, Worsley PS, Di I worth MJ, 
Glenn AR, HowiesonJG. Constructs for insertiona I 
mutagenesis, transcriptional signal localization and 
gene regulation studies in root nodule and other 
bacteria. Microbiology 1999; 145:1307-1 31 6. 
PubMed http ://dx.doi .org/1 0. 1 099/1 3500872 -1 45- 
6-1 307 

31 . DOE Joint Genome Institute user home. 

http://my.jgi.doe.gov/general/index.html 

32. Bennett S. Solexa Ltd. Fharmacogenomics 2004; 
5:433-438. PubMed 

http://dx.doi.Org/10.1517/14622416.5.4.433 

33. Zerbino DR. Using the Velvet de novo assembler for 
short-read sequencing technologies. Current Proto- 
cols in Bioinformatics2010;Chapter 1 1 : Unit 1 1 5. 

34. Gnerre S, MacCallum I, Przybylski D, Ribeiro FJ, 
Burton JN, Walker BJ, Sharpe T, Hall G, Shea TP, 
SykesS, etal. High-quality draft assemblies of 
mammalian genomes from massively parallel se- 
quence data. Proc Natl Acad Sci USA 201 1; 
108:1513-1518. PubMed 

http ://dx.doi .org/1 0. 1 073/pnas. 1 01 7351 1 08 

35. Hyatt D, Chen GL, LocascioPF, Land ML, Larimer 
FW, Hauser LJ. Prodigal: prokaryotic gene recogni- 
tion and translation initiation site identification. 
BMC Bioinformatics 2010; 11:119. PubMed 

http ://dx.doi .org/1 0. 1 1 86/1 471-21 05-1 1-119 

36. Mavromatis K, Ivanova NN, Chen IM, Szeto E, Mar- 
kowitz VM, Kyrpides NC. The DOE-JGI Standard 
operating procedure for the annotations of microbial 
genomes. Stand Genomic Sci 2009; 1:63-67. Pub- 
Med http://dx.doi.Org/1 0.4056/sigs.632 

37. Lowe TM, Eddy SR. tRNAscan-SE: a program for 
improved detection of transfer RNA genes in ge- 
nomic sequence. Nucleic Acids Res 1997; 25:955- 
964. PubMed 

38. Pruesse E, Quast C, Knittel K. Fuchs BdM, Ludwig 
W, PepliesJ, GlocknerFO. SILVA: a comprehensive 
online resource for quality checked and aligned ri- 
bosomal RNA sequence data compatible with ARB. 
Nucleic Acids Res2007; 35:7188-7196. PubMed 
http://dx.doi.org/10.1093/nar/gkm864 

39. Markowitz VM, Mavromatis K, Ivanova NN, Chen 
IM, Chu K, Kyrpides NC. IMG ER: a system for mi- 
crobial genome annotation expert review and 
curation. Bbinformatics 2009; 25:22 71-22 78. 
PubMed 

http ://dx .doi .org/1 0. 1 093/bi oi nformatics/btp39 3 



324 



Standards in Genomic Sciences 



